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Abstract 

Phylogeographic studies have described a reduced genetic diversity in Native American populations, indicative of one or 
more bottleneck events during the peopling and prehistory of the Americas. Classical sequencing approaches targeting the 
mitochondrial diversity have reported the presence of five major haplogroups, namely A, B, C, D and X, whereas the advent 
of complete mitochondrial genome sequencing has recently refined the number of founder lineages within the given 
diversity to 1 5 sub-haplogroups. We developed and optimized a SNaPshot assay to study the mitochondrial diversity in pre- 
Columbian Native American populations by simultaneous typing of 26 single nucleotide polymorphisms (SNPs) 
characterising Native American sub-haplogroups. Our assay proved to be highly sensitive with respect to starting 
concentrations of target DNA and could be applied successfully to a range of ancient human skeletal material from South 
America from various time periods. The AmericaPlex26 is a powerful assay with enhanced phylogenetic resolution that 
allows time- and cost-efficient mitochondrial DNA sub-typing from valuable ancient specimens. It can be applied in addition 
or alternative to standard sequencing of the D-loop region in forensics, ancestry testing, and population studies, or where 
full-resolution mitochondrial genome sequencing is not feasible. 
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Introduction 

Population genetic studies on modern-day Native American 
populations have described the presence of five haplogroups (hgs), 
termed A, B, C, D and X [1-3]. These five hgs are shared with 
East Asian populations and support an entry route to the Americas 
via the Bering landmass. However, Native American populations 
can be distinguished from their East Siberian source populations 
by exhibiting distinct sub-haplogroups (sub-hgs), which can only 
be found in the Americas. These so-called 'founder lineages' have 
been used to describe the demographic history of Native American 
populations and to shed light on the timing of the entry into and 
spread throughout the Americas [4—6] . The fact that the mtDNAs 
of all human populations native to the Americas can be assigned to 
one of the founder lineages pertains to stochastic events that would 
have affected the initial colonizers of the Americas [7] . The low 
genetic variation found in modern Native American groups is 
believed to be due to either population bottlenecks or genetic drift 
[8,9]. 

Most mitochondrial DNA (mtDNA) studies on prehistoric 
American populations involve sequencing of the D-loop, which 
contains Hypervariable Regions 1 and 2 (HVRl and HVR 2 
respectively), to describe a sequence haplotype, from which the hg 



can be inferred [10,11]. Sequencing of the HVR regions of 
mtDNA was relatively cost-effective and less time consuming than 
full mtDNA sequencing, and is therefore still the method of choice 
for many labs which study human populations [11]. Yet not all 
lineages harbour enough variation in the D-loop from which to 
infer a sub-hgs at a deeper level than the overall hg, let alone a 
specific founder lineage [11-14]. As a result, many past and 
present studies on Native American population history have been 
restricted to the information gained from the distribution of the 
major five Pan-American hgs. 

The coupling of multiplex polymerase chain reaction (PGR) 
with a Single Base Extension (SBE) reactions, based on the 
established SNaPshot (Applied Biosystems) or minisequencing 
principle, has been widely used to design panels of single 
nucleotide polymorphisms (SNPs) for forensic and anthropological 
studies [15]. It has also found wide use in population genetic 
studies focussing on mtDNA and Y-chromosome SNPs, either 
including SNPs with a global representation or via a targeted 
selection of characteristic SNPs representing specific geographic 
regions [13,16-21]. The design of a SNP panel including those 
markers defining the 15 American founder lineages described by 
Perego et al. [22] and more had not been attempted, although 
'Multiplex 3' in van Oven et al. 201 1 [23] covered 12 out of these 
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15. The primary aim of this study was therefore to design a novel 
SNaPshot assay that enables a fast and cost-efiicient high- 
resolution typing of the majority of known Native American 
sub-hgs by targeting 26 characteristic SNPs. Our goal was to 
develop an assay that is universally applicable to accommodate the 
specific needs of damaged and degraded DNA in ancient DNA 
work and forensics. Selective sequencing of the SNP regions of 
interest not only allows for flexibility in the number and choice of 
SNP sites but also allows (with reservations) the design of ultra- 
short amplicon lengths (50-80 bp) suitable for degraded DNA 
typing [13], whik- using far ktss DNA than traditional sequencing 
methods or SNP-typing in individual singleplex PCRs. This is of 
great importance in forensic and ancient DNA studies where 
sample DNA is a limited resource [23,24]. The secondary aim was 
to develop an assay that could complement an established assay 
with a global set of SNPs (GenoCore22, see [24] but also [23]) and 
at the same time provide a fast and efficient screening tool that 
allows the assessment of overall sample quality (presence of very 
short fragments of endogenous mtDNA and absence of contam- 
inant hgs) for further use in mitochondrial genome sequencing via 
DNA library preparation and targeting enrichment techniques, 
e.g. [2,5-27]. 

Materials and Methods 

AmericaPlex26 SNP selection 

We developed a multiplex SNaPshot reaction targeting 26 SNP 
sites in total including characteristic SNPs of the four major Pan- 
American sub-hgs A2, B2, Gl and Dl, as well as SNP sites for the 
minor Pan-American lineages C4c, D2a, D4h3a and X2a [6]. The 
initial choice of SNPs was based on a study by Perego et al. [22] 
describing 15 American founder lineages. Additional SNP sites 
were chosen for sub-hgs within each major hg based on the most 
up-to-date mtDNA phylogeny available at the time (phylotree.org, 
mtDNA tree Build 13, 28 Dec 2011) in order to enhance the 
discriminating power of the assay. For sub-hgs defined by more 
than one characteristic SNP we employed selection criteria during 
the primer design stage based on the abihty to design primers with 
high specificity in the short flanking region around the SNPs, and 
under a consensus-melting temperature for all pairs in a multiplex 
environment. 

Presented below is a summary of each major Native American 
sub-hg, their distribution throughout the Americas, as well as the 
SNP sites chosen to represent the hg and their respective sub-hgs. 
The representative SNPs typed in the AmericaPlex26 are given in 
parentheses and a simplified tree illustrating the phylogenetic 
relationship is shown in Figure 1. 

Haplogroup A2 (G12007A) is found throughout the Americas, 
but its derivatives A2a (C3330T) and A2b (T11365C) are mainly 
found in the Northern parts of North America in Inuit, Na-Dene 
and Siberian populations such as Kor\'aks and Chukchi [22,28- 
30], whereas particular subgroups of A2a were also reported from 
Athapaskan territories in the Southwest [30]. A16265G (defining 
A2b in [22]) was also added to the assay as it was further resolved 
to represent sub-hg A2bl, which can be found in Eskimoan- 
speaking populations (such as the Inuit and Yupik) across the 
Arctic [31]. 

Haplogroup B2 (C11177T) is widely distributed throughout 
the Americas and is nested within hg B4b (G4820A), which has a 
largely Asian distribution. Studies on modern-day populations 
describe sub-hgs within B2, which are prevalent in specific 
geographic areas within the American continents and thus 
phylogeographically informative [32]. We included diagnostic 
SNPs for sub-hgs B2a (G16483A, Nortii/Central), B2b (G6755A, 



ubiquitous), B2c (A7241G, predominantiy Gentral), B2d (T8875G, 
Central/South), B2e (G61 19T, South) and B2f (T10535C, Nortii/ 
Central) to monitor this phylogeographic pattern in ancient Native 
American populations through time. 

Haplogroup C is represented in the Americas by sub-hgs 
Gib, Clc and Cld [22,32], whereas sub-hgs Cla and Cle are 
Siberian/East Asian and European sister-clades, respectively. Sub- 
hg Gib (A493G) can be found throughout South America. Sub-hg 
Glc is most frequent in Mexico [22] and was split into sub-hgs 
Glcla (A12978G) and Glc2 (G14356T), since the immediate 
flanking region of the two Clc SNPs defining (G1888A and 
G15930A) were not suitable for primer design. A recent study by 
Perego et al. [22] has further resolved Gentral American sub-hg 
Cld (A16051G), which now includes sub-hg Gldl (G7697A). 
Minor Pan-American hg C4c (G14433T) was recentiy discovered 
in an ancient sample from British Columbia, and was found to be 
one of the founding lineages of the Americas based on coalescent 
age estimates [33,34]. 

Haplogroup D is found in the Americas as four distinct sub- 
hgs Dl (G2092T), D2a, D3 and D4h3a [6,35]. Sub-hg D2a has 
been found in Sireniki, Yuit, Aleut and Chukchi populations in 
Siberia and Aleuts from the Commander Islands [31,36,37], and 
has been further resolved into sub-hgs D2al (A9667G; including 
tiie Saqqaq Paleo-Eskimo [31]), D2a2 (G4991A) and D2b 
(A9181G) [22]. Recent studies have further revealed sub-hg D3 
to be more derived than previously thought and nested within the 
larger branch D4bl (C10181T), which currenfly encompasses 
three sub-hgs within North American and Siberian native 
populations: D4bla, D4blb'd, and D4blc [31]. Therefore, we 
chose the ancestral SNP defining hg D4bl for our panel (See 
mtDNA tree BuUd 13, 201 1). In addition, sub-hg D4h3a (G6285A) 
is distributed along the Pacific coast of the American continents [6] 
but most frequent in South America. 

Haplogroup X2a (A8913G) has only been found in a limited 
number of samples in North American populations as compared 
to those of A2, B2, C 1 and D 1 , and is therefore described as minor 
founding lineage in this paper [8,35]. 

Lastly, SNP site T14783G was included as control to define 
macro-hg M, which encompasses hgs C and D. In contrast, this 
SNP retains the ancestral state in hgs A, B and X, which belong to 
macro-hg N. 

Primer and probe design 

PGR and SBE primers were designed and quality-controlled 
using default settings and features in the software package 
Geneious v5.2 (Geneious version (5.2) created by Biomatters. 
Available from http:/ /www.geneious.com/) and Batchprimer3 
vl.O [38], both based on the program primerS [39], generally 
following the guidelines set out in Sanchez et al. 2006 [40]. 
Amplicon sizes were deliberately kept smaller than 90 bp in length 
to allow amplification of highly fragmented DNA as typical in 
forensics and ancient DNA studies. Given how short the flanking 
regions of each SNP were, which already constrained our selection 
of suitable SNPs, we could not consider potential polymorphic 
sites in these areas nor nuclear insertions, and rehed on empirical 
testing of PGR primer efiiciency. SBE primers were then ranked 
according to quality score and orientation (forward or reverse) for 
efficient use of fluorescent dyes and fragment length spacing. The 
latter was adjusted to 4 bp by adding poly-CT tails to the 5 'end of 
each SBE primer (Table 1) [41]. 

Ethics statement 

AH necessary permits were obtained for the described study, 
which complied with all relevant regulations. Permissions to 
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Simplified phylogeny of SNPs typed in AmericaPlex26 assay 
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Figure 1. Simplified pliylogeny in related to the Reconstructed Sapiens Reference Sequence (RSRS [52]) and typing scheme of the 
26 SNPs targeted in the AmericaPlex26. Sub-haplogroups, which can be unambiguously assigned, are shown in blue (blue). Basal hgs, which 
cannot be unambiguously assigned, are also shown (black) in order to illustrate the phylogenetic relationship, but also the inherent limitations of our 
assay. The phylogenetic position of the revised Cambridge Reference Sequence (rCRS [53]) is indicated within macro-hg N. SNPs with a SBE primer in 
reverse direction targeting the opposite strand are given in Italics. 
doi:1 0.1 371/journal.pone.0093292.g001 
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collect, export and analyze ancient Peruvian specimens from the 
Huaca Pucllana site were granted by the Ministry of Culture (the 
former National Institute of Cultural Heritage - INC) and the 
National Museum of Archaeology, Anthropolog)- and History of 
Peru (MNAAHP) and are available on request (ACTA No 01 7- 
2010-ARMC-MNAAHP-MC and Resolucion Viceministerial 
No. 1 20-20 10-VMPCIC-MC). No specific permits were required 
for the modem control samples when solely used for methodo- 
logical validation (waiver from The Human Research Ethics 
Committee (HREC) at the University of Adelaide). Swab samples 
were nevertheless collected using written informed consent. 

Samples and DNA extractions 

DNA from modern control samples (AC, GV) was extracted 
from cheek swabs using QIAamp DNA Mini Kit (Qiagen) a 
following the manufarturer's instructions. Ancient samples were 
collected by MIBR, BL and WH under DNA-free conditions at 
the Museo de Sitio Huaca Pucllana, Calle General Borgono 
cuadra 8 s/n, Miraflores, Lima, Peru, where the samples are 
stored. Sample preparation, DNA extractions and PCR amplifi- 
cation from ancient samples were performed at the Australian 
Centre for Ancient DNA in Adelaide, Australia, applying 
established methods and authentication criteria as described 
previously [26,42,43]. In brief, we used an in-house silica 
extraction method, detailed in [26], to extract DNA from two 
independent samples per individual. PCR amplifications from 
each extract and direct sequencing of the HVR-I were performed 
using four overlapping primer pairs with reaction conditions 
described in [42,44] . Details of the four primer pairs are given in 
(Table 2). 

Multiplex PCR amplification 

PCR amplifications were carried out in a final reaction volume 
of 12.5 |J,1 consisting of 0.5 [iL DNA sample (3 |iL for ancient 
DNA), Ix PCR Gold Bufier, 6.5 mM MgClg, 0.1 U AmphTaq 
Gold DNA polymerase (all Applied Biosystems) 1.25 mM dNTP 
solution (Bioline Pty Ltd), (0.8 \ig RSA for ancient DNA samples), 
and a primer mix consisting of 26 primer pairs, with concentra- 
tions given in Table 1 . PCR was carried out on a Tetrad 2 Peltier 
Thermal Cycler (Bio-Rad Laboratories) using the following 
conditions: 95°C for 6 min and 30 cycles (45 cycles for ancient 
DNA samples) of 95°C for 30 s, 55°C for 30 s, 65°C for 30 s, and 
a final extension time at 65°C for 6 min. Amphfication success was 
monitored via gel electrophoresis on an 3.5% agarose gel (100 V 
for 40 min; Hyperladder V DNA size ladder (Bioline Pty Ltd)). 
PCR products were purified by mixing 5 |il of PCR reaction with 
1 U ExoSAP-IT (Thermo Fisher Scientific Australia Pty Ltd), 
followed by incubation at 37°C for 50 min, 80°C for 15 min and 
15°C for 10 min. Single Base Extension reactions consisted of a 
final volume of 5 |iL containing 1 |iL PCR product, 2.5 |iL 
SNaPshot ready reaction mix (Applied Biosystems), and 0.5 |iL 
extension primer mix (individual concentrations are given in 
Table 1). ThermocycUng of the SBE reactions was performed in a 
Tetrad 2 Peltier Thermal Cycler (Bio-Rad Laboratories) with the 
following conditions: 96°C for 10 s; followed by 35 cycles of 55°C 
for 5 s and 60°C for 30 s. SBE products were purified by adding 
1 U Shrimp Alkaline Phosphatase (Thermo Fisher Scientific) to 
the reaction solution and incubating it at 37°C for 50 min, 80°C 
for 15 min and 15°C for 10 min. 

Capillary electrophoresis was performed on a 3 1 30 xl Genetic 
Analyser (Applied Biosystems) using POP-6 polymer and a 
customised run module, by adding 1 |iL sample DNA to 
18.5 nL Hi-Di Formamide and 0.5 |xL GeneScan-120 LIZ 
internal size standard (Applied Biosystems). Electropherograms 



were analysed using the software Genemapper ID version 3.2.1 
software (Applied Biosystems) applying custom panel and bin 
settings available on request. 

Sensitivity tests 

We performed sensitivity studies using serial dilutions of 1, 1:10, 
1:100, 1:1000 and 1:10,000 of DNA from a buccal swab sample 
from a lab member (AC). The mtDNA copy number of the 
modern sample was determined through qPCR using the SYBR- 
Green kit (Qiagen), targeting a short 77 bp fragment of human 
mitochondrial DNA with primer pair L13258 and II13295 [45]. 
Serial dilutions were treated as separate samples, and each sample 
was analysed in triplicate. The qPCR reaction was performed in a 
total reaction volume of 10 nL consisting of 1 (xL of each sample 
dilution, 2x Brilliant SYBR Green Master Mix and 0.1 ^M of 
each primer. The qPCR were carried out on a Rotor-Gene Q, 
Real-Time PCR cycl(;r (Qiagen) with thermocycling conditions as 
follows: 95°C for 3 min, followed by 45 cycles of 95°C for 10 s, 
58°C for 20 s and 72°C for 20 s. 

Results and Discussion 

Optimization of the multiplex protocol 

The AmericaPlex26 assay was initially tested with default 
concentrations of 0.017 |iM for each primer (3 |tL of 25 |iM stock) 
and 0.015 jjM for each SBE primer (3 (J,L of 50 [iM stock) to assess 
the generic efiiciency of primers or probes when used in the multiplex 
assay. Twenty-two out of 26 SNP sites could be readily amplified, 
albeit with highly variable peak heights across the assay. Primers and 
probes for the four problematic SNP sites were each tc-sted in 
singleplex PCR and SNaPshot reactions to ensure they performed 
individually as expected. If the SNP fragment were successfully 
amplified in the singleplex PCR, concentrations of the primer would 
be doubled in the following multiplex PCR reaction mix. 

We cliosc' 3000 relative fluorc'sccncc units (rfu) as a default 
average peak height based on the ancestral allele status observed in 
our European modern control sample (AC), and calculated the 
percentage difierence between peaks and the 3000 rfu average. 
Multiplex primer concentrations were adjusted according to this 
percentage difference to allow amplification of problematic SNP 
sites and to balance the peak heights of those that did amplily. 
Based on poor performance of the primer pair chosen to amplify 
the Clb SNP site (A493G), we performed a second round of 
balancing primer concentration with a new primer pair for this 
site. 

To further refine the balance in peak height, the concentration 
of some SBE extension primers was adjusted to the final 
recommended concentrations given in Table 1. Changes to probe 
concentrations resulted in a more balanced electropherogram and 
amplification of all 26 SNP sites using modern buccal swab and 
ancient DNA samples (Figure 2). 

Sensitivity studies 

The amount of mitochondrial DNA was measured for a modern 
sample (1,171,699 copies/(iL) and four serial dilutions of 1:10, 
1:100, 1:1000 and 1:10,000 using real-time quantitative PCR 
(Figure 3). A near complete SNP profile could be observed for 
serial dilutions up to 28,278 copies/|iL DNA (1:100), which is 
similar to other published multiplex assays [23,24,46]. 

Method application on ancient samples 

The AmericaPlex26 assay was tested on ancient samples from 
three successive pre-Columbian cultures from the Huaca Pucllana 
archaeological site in Lima, Peru. They included samples from the 
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Table 2. Details of primers used for standard HVR-I amplification and sequencing. 





Primer 


Primer Sequences 5' to 3' 


Length in bp (IncL/excl. primer) 


Reference 


LI 6055 


GAAGCAGATTTGGGTACCAC 


1 26 (87) 


[54] 


H16142 


ATGTACTACAGGTGGTCAAG 




[55] 


L16117 


TACATTACTGCCAGCCACCAT 


162 (115) 


[42] 


HI 6233 


GCTTTGGAGTTGCAGTTGATGTGT 




[42] 


LI 6209 


CCCCATGCTTACAAGCAAGT 


179 (138) 


[54] 


HI 6348 


ATGGGGACGAGAAGGGATTTG 




[42] 


LI 6287 


CACTAGGATACCAACAAACC 


162 (122) 


[54] 


H16410 


GCGGGATATTGATTTCACGG 




[54] 



doi:l 0.1 371 /journal.pone.0093292.t002 



Early Intermediate (n = 20; 200-600 AD), the Middle Horizon 
(n = 20; 600-1000 AD) and the Late Intermediate (n= 12; 1000- 
1476 AD) [47] plus an Early Medieval European samples as 
control for the ancestral state. Samples from each period varied in 
the state of preservation, due to differences in mortuary customs. 
From our test dataset of 52 samples in total, we were able to 
unambiguously type 29 samples (56%) (Table 3, Figure 4). A 
typing result was considered reliable when two samples from the 
same individual could be unambiguously assigned to the same sub- 
hg in two independent experiments. 

We subsequently compared the AmericaPlex26 assay results to 
our previous attempts at amplifying and sequencing the mito- 
chondrial HVR-I with four overlapping primer pairs and found 
that the AmericaPlex26 assay improved the typing efficiency from 
ancient samples (Table 3, Figure 4). For example, the Amer- 
icaPlex26 assay allowed reliable SNP typing for eleven Early 
Intermediate (55%) and ten Late Intermediate samples (83%), 
whereas HVR-I sequencing gave reliable sequence haplotypes for 



seven (35%) and seven (58%) samples, respectively. For example, 
HVR-I sequencing for samples 10802A and 10803A failed, while 
the AmericaPlex26 assay revealed specific hg B2 (Table 3). 
Samples from the Middle Horizon culture were in general less well 
preserved, resulting in eight consensus sub-hg calls (40%) using tiie 
AmericaPlex26 assay, whereas HVR-I sequencing did not produce 
any reliable sequence haplotype from the sample replicates (0%). 
This highlights the genotyping power of our assay when dealing 
with challenging samples. 

Importandy, SNP typing with the AmericaPlex26 assay also 
gave a higher resolution compared to traditional HVR-I 
sequencing. For example, samples 10809A and 1081 OA of the 
Early Intermediate period were assigned to the major hg CI by 
HVR-I sequencing, yet the AmericaPlex26 assay allowed further 
resolution to sub-hg Clb. In addition, while many of the HVR-I 
results from Late Intermediate samples remained tentative, i.e. 
non-reproducible, the AmericaPlex26 assay provided reliable and 
specific sub-hgs for both replicates (Table 3, Figure 4). Taking all 



A 
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500 

B 

2000; 
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1000- 
500 



493A 




7241A i1605lA Ancient South American sample ([ng B2, xB2a-f) 120 7G 

16256A 891 3A 



12978A 




6119G 



4820T 



85C . 



14356G 



16483C 

7697CI 



I 

t14783T ■ 
1 1-ra5T 



Ancient European control sample (hg HV) 



12007G 



7241 A 



16051 A 
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9181Ti3330C 
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1 „ 14356G 

' I i I \ • 

887511 1 I I 



14783T 



Figure 2. Electropherograms of two ancient examples representing tKie optimized AmericaPlex26. Panel A shows a South American 
sample and panel B an ancient European sample illustrating the ancestral state of all 26 SNPs. The two SNPs sites defining hg B2 are highlighted in 
red. Note the ancestral state at SNP sites defining B2 sub-haplogroups B2a (16483), B2b (6755), B2c (7241), B2d (8875), B2e (611 9) and B2f (10535). 
doi:1 0.1 371/journal.pone.0093292.g002 
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Figure 3. Electropherograms from the sensitivity tests showing four serial dilutions of template DNA with a starting copy number 
of 1,171,699 copies/fiL. Note the increasing number of locus dropout as template DNA concentration decreases. 
doi:1 0.1 371 /journal.pone.0093292.g003 



results together, the AmericaPlex26 assay showed a sigiiificaiidy 
higher success rate when compared to the standard HVR-I 
sequencing (p<0.0001, Wilcoxon matched-pairs signed rank test; 
Figure 4). This is likely due to the difference in amplicon sizes 
between the two methods. 



The overall effectiveness of the assay 

Overall, the effectiveness of the multiplex SNaPshot method in 
analysing ancient DNA lies in the fact that it only requires minimal 
flanking regions either side of the SNP, which in theory allows the 
design of very short overall amplicon sizes [40,48], 56-90 bp in 



Early Intermediate (N=20) ■ Middle Horizon (N=20) ■ Late Intermediate (N=12) "Total (N=52) 




NewWorldPlex26 



Figure 4. Genotyping success of the AmericaPlex26 assay compared to standard HVR-I sequencing via four overlapping amplicons. 

Tfie success rate is given in percentage of unambiguous genotype calls for each of the two methods per cultural horizon and all results combined 

(black bars, p<0.0001). 

doi:1 0.1 371/journal.pone.0093292.g004 
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Table 3. Direct comparison of results for HVR-I sequencing and AmericaPlex26 SNP typing assay for samples unambiguously 
typed using the AmericaPlex26 assay. 



Individual 


Museum no. 


Samples 


HVR 1 hg 


AmericaPlex26 


Consensus 


El 1 


A06 95/96 


10802A 


3 


B2 


B2 






10803 A 


? 


B2 




El 2 


A06 79/01 Ind2 


10804A 


Dl 


Dl? 


Dl 






10805A 


Dl 


M 




El 3 


A06 76/96 


10809A 


CI 


Clb 


Clb 






1081 OA 


CI 


Clb 




El 4 


A15 06/00 


1081 3 A 


B4 


B2 


B2 






10814A 


B4 


B2 




El 5 


A06 01/02 


10817A 


CI 


Clb 


Clb 






10818A 




Clb 




El 6 


A6 68/96 


10821A 


B4 


B2 


B2 






10822A 


B4 


B2 




EI7 


A06 90 Indl 


10787A 


7 


B2 


B2 






10788A 




B2 




El 8 


A20 05/09 


10789A 


B4 


B2b 


B2b 






10790A 


7 


B2b 




El 9 


A20 03/07 Indl 


10791 A 


A2 


A2 


A2 






10792A 


A2 


A2 




El 10 


A06 82/96 


10793 A 


B4 


82 


B2 






10794A 


B4 


B2 




El 11 


A06 77/96 


1079eA 


B4 


B2b 


B2 






10797A 


7 


B2 




El 12 


A06 79/96 Indl 


10800A 


B4 


B2b 


B2 






10801 A 


B4 


B2 




MH 1 


A20 08/08 Ind2 


10733A 


7 


Clb 


Clb 






10734A 




Clb 




MH 2 


A20 CF003/09 


10741 A 




B2 


B2 






10742A 




B2 




MH 3 


A20 07/08 Ind4 


10749A 




B2 


B2 






10750A 




B2 




MH 4 


A20 05/08 


10753 A 




B2 


B2 






10754A 


B4? 


B2 




MH 5 


A20 18/08 


10765A 




Clb 


Clb 






107e6A 




Clb 




MH 6 


A20 01/09 Indl 


10772A 


CI 


Clb 


Clb 






10773 A 


7 


Clb 




MH 7 


A20 01/09 Ind2 


10774A 


7 


A2 


A2 






10775A 




A2 




MH 8 


A20 04/07 Ind2 


10778A 


D? 


A2 


A2 






10779A 




A2 




LI 1 


A15 01/02 


10709A 


CI 


M 


CI 






10710A 


CI 


Clb 




LI 2 


AO cfl4 ind-1/98 


10712A 


B4 


B2 


B2 






10713A 


B4 


B2 




LI 3 


AOl CF16/98 


10715A 


7 


82 


B2 






107ieA 




B2 




LI 4 


AO CF15/01 


10717A 


CI 


M 


M 






10718A 


? 


M 
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Table 3. Cont. 





Individual 


Museum no. 


Samples 


HVR 1 hg 


AmericaPlex2e 


Consensus 


LI 5 


A15 CF36/01 


10719A 


B4 


B2 


82 






10720A 


B4 


B2b 




LI 6 


A15 Sin Contexto 


10724A 


CI 


Clb 


Clb 






10725A 


CI 


Clb 




LI 7 


A3 CFOl/04 


1072eA 


3 


B2b 


82 






10727A 




B2 




LI 8 


AO 08/98 


10728A 


B4 


82 


82 






10729A 


B4 


B2 




LI 9 


A15 02/02 


10730A 


B4 


B2 


82 






10730Y 


B4 


B2b 




LI 10 


AO 56/97 


10731A 


CI 


Clb 


Clb 






10731Y 


CI 


Clb 





Consensus haplogroups were called based on last common SNP from both replicates from independent extractions, and minimum peak size >50 rfu. (?)/(—}: 

Insufficient or no sequence information; El: Early Intermediate; MH: Middle Horizon; LI: Late Intermediate. 

doi:10.1371/journal.pone.0093292.t003 



our case. This makes it suitable for the extremely fragmented and 
damaged state of ancient DNA, while the multiplex approach 
maximises the amount of information that can be gained per PGR 
[15,49,50]. As such, SNaPshot typing is able to generate results for 
samples for which traditional sequencing methods often fail with 
ancient and/or degraded DNA, as they require longer fragment 
lengths to be cost-effective [51]. Multiplexing also allows the 
combination of many informative SNP sites into one reaction, 
which are otherwise spread across longer sequence regions. On its 
own, the multiplex PGR and SNaPshot method is time- and cost- 
effective, and requires substantially smaller amounts of valuable 
DNA extract compared to HVR-I sequencing, as fewer individual 
reactions are needed from preparation of the multiplex PGR to 
capillary electrophoresis [48,50]. 

We show that the AmericaPlex26 can be used to complement or 
expand upon standard mtDNA sequencing approaches for ancient 
Native American populations, and especially for ancient samples 
where DNA preservation does not allow amplification of longer (> 
100 bp or more) DNA molecules. It efficiently and economically 
targets characteristic SNPs from the coding region of mtDNA [15] 
in order to corroborate HVR-I sequencing results and to define a 
particular sub-hg [14]. Alternatively, the AmericaPlex26 can be 
used in addition to global mtDNA SNP multiplexes, such as the 
GenoCore22 and others [23,24]. Moreover, it is flexible enough to 
add newly discovered SNPs/lineages in order to enhance sub- 
regional resolution (see e.g. [32,35]. 

In our experience, the new method provided an extremely 
useful one-reaction test to screen larger numbers of degraded 
samples allowing the assessment of the general state of preserva- 
tion, the authenticity of the result (i.e. absence of potential 
contaminating lineages), while at the same time allowing a 



categorisation of potentially interesting sub-hgs. We are currentiy 
using this approach in order to further dissect the phylogenetic 
resolution via DNA library creation and targeted mtDNA 
enrichment and Next Generation Sequencing [25,26]. 

Conclusions 

We present a powerful, optimized SNP assay, which allows 
unambiguous typing of Native American mtDNA Tounder 
lineages' and additional SNPs for further resolution. This short- 
amplicon AmericaPlex26 assay is highly efficient, time and cost- 
effective compared to classical HVR-I sequencing, and allows 
highly resolved SNP typing of degraded DNA samples in forensic 
and ancient DNA work. It is suitable as a qualitative 'screening' 
method to identify samples with sufficient DNA preservation, free 
of contaminants that complicate full mitochondrial sequencing 
(and beyond) via Next Generation Sequencing techniques. 
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